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Abstract 

In this note I briefly discuss ideas related to the so-called fourth-root trick. 
A decomposition of the "rooted" fermion effective action into Wilson fermions 
and a nonlocal, lattice spacing suppressed functional is presented, complete 
with link interactions. Some proposals are given for analytical, nonperturbative 
studies of the fourth-root trick. 



*Presently : giedtOphysics . umn . edu 



1 Introduction 

Lattice QCD with improved staggered fermions (SF) has recently enjoyed publicity 
for its ability to correctly reproduce many aspects of hadron physics with reasonable 
accuracy [1,2]. However, criticism has been leveled at the approach [3-5] due to, 
among other things, the use of the fourth-root trick (FRT). In this note, I briefly 
review aspects of this issue, and mention some ideas in relation to it. 

The FRT is used because staggered fermions, or, Kogut-Susskind fermions [6-8], 
do not entirely overcome the fermion doubling problem. Rather, they reduce the 
number of continuum modes from 16 to 4. These 4 modes are referred to as tastes, 
so as to distinguish them from the Nf flavors in the continuum (target) theory. To 
estimate the fermion measure of Nf continuum flavors, one takes the power Nf/ A of 
the fermion determinant in the definition of the functional integral. This is what I 
will refer to as the fourth-root trick, although it is often called the square-root trick 
since in QCD two light flavors are used. 

In the case of free staggered fermions, the eigenvalues Uk of the fermion matrix 
M S f are 4- fold degenerate, corresponding to symmetries that relate the 4 tastes. 
Thus there appears to be nothing wrong with the FRT: 



That is, one effectively weights by the eigenvalue spectrum of some other "fermion 
matrix" M e ff. It has been shown by Shamir [9] that RG blocking allows one to write 
(as suggested in [10]) 



Here, D RG is a local lattice Dirac operator with a single flavor. T is a local operator 
that only contains UV modes. Thus nonlocality contained in det T 1 / 4 factorizes into 
a harmless overall factor once the UV modes are integrated out. The long-distance 
effective action is local. 

The problem is that one is not interested in free staggered fermions. Rather, in 
QCD the gauge- covariant staggered fermion operator contains interactions through 
the link fields U^r). For a generic configuration of the link fields, the 4-fold degen- 
eracy of the eigenvalues of the fermion matrix is destroyed: 




(1.1) 



det M eff = det D RG det T 1/4 . 



(1.2) 
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In this circumstance, one has a right to worry about the FRT. 

The splitting of eigenvalues occurs due to taste violating (TV) interactions. In an 
expansion of link bosons about the identity, these arise from higher lattice-derivative, 
irrelevant operators. That is, the effect is suppressed by a power of the lattice spacing 
a that depends on the level of improvement of the lattice action, and increases with 
the amplitude of UV components of the link configuration. The asymmetric mixing 
of tastes that occurs in the presence of a nontrivial link configuration lifts the 4-fold 
degeneracy. Numerical studies of these splittings have been performed for dynamically 
generated link configurations. For example, in [11] it was seen that smearing of links — 
which reduces the TV effects by smoothing out UV link fluctuations — can render 
the IR eigenvalues "near degenerate (sic)." TV effects are regarded as a source of 
systematic error that can be reduced with enough effort. Since it has been observed 
that many physical observables are insensitive to a truncation of UV eigenvalues of 
the lattice Dirac operator (e.g. [12]), the nonlocalities associated with "rooting" a 
spectrum with nearly degenerate IR eigenvalues and nondegenerate UV eigenvalues 
are perhaps harmless. While this sort of argument is somewhat reassuring, clearly it 
would be preferable to rigorously address the consistency of the "rooted" theory. 

Improved staggered fermions are used because they are efficient, cost effective, and 
seem to give good results. This raises simple questions that we would like to answer: 
Why does the FRT work in practice? At what point might it fail? For instance, 
is there a limit on its applicability due to a breakdown of unitarity at some short 
distance scale? 

I now summarize the remainder of this note: 

• In ^21 I outline various perspectives on the FRT. Firstly, in £ 12.11 1 remark on 
the effects of the FRT in perturbation theory. I reach the conclusion which 
has been stated many times: the FRT poses no problem for weak coupling 
perturbation theory about the = vacuum. I present details as to why 
this is a reasonable conclusion. Secondly, in £ 12. 21 1 speculate as to analytical, 
nonperturbative studies that could be done to study the FRT. Thirdly, in £12.31 
I present an unusual decomposition of the "rooted" fermion effective action, 
in terms of Wilson fermions, and a nonlocal TV operator that has explicit 
suppression by the lattice spacing. Applications of this decomposition, which 
isolates the taste-violating part of the theory, are left to future work. 

• In ^21 I discuss some details of the staggered fermion action. A translation into 
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the taste basis, without recourse to expansion about = 1, is described. The 
explicit form of the decomposition into Wilson fermions and a TV correction is 
given in the taste basis. 

• In £0J I conclude with a summary. Directions for future research are recapitu- 
lated. 

2 Perspectives 

2.1 The perturbative view 

Let me begin with conventional perturbation theory. One expands about the U^(r) = 
exp(iagAn(r)) — > 1 limit. First I write 

M SF (U) = M SF (1) + \M SF {U) - M SF {1)) = M SF (1) + AM SF (U). (2.1) 

Note that AMs F (U) = OfagA^) and contains, among other things, the minimal 
fermion-boson vertex. With the FRT applied, the fermion measure is represented by 
the following effective contribution to the action: 

Seff = -^-Tr\n[M SF (l) + AM SF (U)] 
Nf 

= — -± Tr In [1 + Ms F {l)AM SF {U)] + const. (2.2) 

One expands exp(— S e ff) in AMs F (U) to obtain a series of terms, each Tr multiplied 
by Nf/4. One further expands AMsf{U) in terms of = expiagA^ to obtain weak 
coupling perturbation theory and the continuum limit. Since each Tr corresponds to 
a fermion loop, the effect of the FRT is seen to be as follows: each diagram of ordinary 
SF perturbation theory is multiplied by (Nf/4) n , with n the number of fermion loops. 

If perturbation theory is consistent at Nf = mod 4 (i.e., for ordinary SF pertur- 
bation theory), then it is difficult to see how any problems would arise for values of Nf 
in-between. E.g., let Zi, m, g, . . . be chosen to renormalize the theory for Nf = mod 
4. These then become functions of Nf/4, with a natural extension to arbitrary Nf. It 
is conceivable that cancellations between divergent diagrams happen at Nf = mod 
4 but not at other values. That would indicate the need for additional counterterms; 
one might worry whether or not they are always local, given the apparent nonlocal 
nature of the rooted SF matrix. However, the symmetries of the SF action are still 
operative with the FRT applied, so the form of counterterms is similarly restricted. 
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Also, it can be seen that the perturbation series is constructed entirely from local 
vertex operators, since the only change is a factor of Nf/A for each fermion loop. 
This is enough to exclude the possibility of nonlocal counterterms. 

Finally, renormalization of composite operators could be differerent when Nf ^ 
mod 4, due to a lack of certain cancellations. But, I have no example to cite. 

The conclusion I draw from these considerations is in agreement with "standard 
lore" : the FRT poses no problem in perturbation theory. 

2.2 Nonperturbative ideas 

Perhaps nonperturbative effects of the FRT might be accessible through an instanton 
calculus, or some other semiclassical expansion. Whereas it has been argued above 
that the FRT poses no problems for perturbation theory about the = vacuum, 
perturbation theory about some other background could conceivably yield different 
conclusions. The point is to compare to results obtained from other approaches 
in a similar regime: from the continuum, Ginsparg- Wilson fermions, etc. Finding 
any discrepancy and understanding its origin would teach us valuable things. Finite 
system volume, perhaps in conjunction with twisted boundary conditions, could help 
to exert greater theoretical control over such calculations. 

It might also be of interest to look at strong coupling expansions with the FRT 
applied. Some nonperturbative features, such as chiral symmetry breaking, can be 
studied by this approach; e.g. [13]. One could compare expectations for the spectrum 
of mesons and baryons for a theory with Nf flavors to what occurs in the strong 
coupling limit with fermion measure (|2.2|) . 

2.3 The Wilson fermion decomposition 

In this decomposition, I organize the effective fermion measure in such a way that 
TV effects are isolated for systematic study, and suppression by the lattice spacing is 
explicit. 

Let My/F be a single flavor Wilson fermion (WF) matrix. Then I decompose the 
staggered matrix as follows: 

M SF = M WF ®U + aM TV , (2.3) 

where Mtv contains all the TV effects. This decomposition is possible because in the 
taste (flavor) basis for staggered fermions [14,15], the terms that are not suppressed 
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by a are just four identical flavors of the naive lattice Dirac matrix. (With gauge 
interactions included, some reinterpretation of link variables is required; c.f. 1)3. 7J) . 
This will be explained in detail below.) Once this decomposition has been effected, I 
note that 

det M^/ /4 = det M% F exp Tr In [l + a{M^ F <g> U)M TV ] . (2.4) 

It can be seen that this arrangement sequesters all the TV effects and nonlocality into 
0(a) correction terms. 1 The fermions of the theory now consist of Nf degenerate 
flavors, each with a conventional matrix My/F- The correction term is a nonlocal 
functional of the the link fields. 

One disadvantage of the Wilson fermion decomposition is that it will typically 
obscure the staggered fermion symmetries, such as the one that prevents additive mass 
renormalization. These are hidden in the induced tranformations of the correction 
term. 

Finally, I remark that the Wilson fermion is not the unique choice for the type of 
decomposition described here. Any other fermion that consists of the naive fermion 
plus 0(a) terms would also work. It remains to be shown that this decomposition 
has a useful application. I will not do that here, but hope to find one in the future. 



3 Taste basis details 

The fermion action I start with is the single-flavor staggered fermion, written in 
the conventions of Kluberg-Stern et al. [15]. It is just (with obvious notations and 
summation conventions) 

S S f = -^a 3 a M (r) [x(r)U^(r)x(r + fi) + x(r + //)L/J(r)x(r)] 

+ ima 4 x( r )x( r )- (3-1) 

The phases are, as usual, a M (r) = (— l)^<^ Tv . One associates a (2a) 4 hypercube 
with each site r = 2y, where y has integral entries. Sites contained in the hypercube 
are labeled on the original lattice by 

r = 2y + V , V EK = {(0 4 ), (1^), ( 1 2 ,0 2 ), (1^0), (l 4 )}. (3.2) 

1 I note that the operator [(M^f <g> I^Mtv} 1 ^ 4 is the same one that Adams has studied in [10]. 
In this respect, his proposal is related to the one presented here. 
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Powers indicate how many times a or 1 appears and underlining indicates that all 
permutations of entries are to be included. 

To proceed further, one defines fermion fields for each point in the hypercube and 
transforms to the (covariant, position space) taste basis: x( r )>x( r ) — * <? Qa (2/)> 
Since this is all very well-known, I just summarize the ingredients: 

X (2y + V ) = (-l)^ Xr) (y), xi^V + v) = (-1) E "^X,(Z/), 

r, = iTififiT, 7,} = -2<W, 

U M (y) = U,(2y + rj), T^, = ~ Tr (r^), 

U v (y) = XJf(2y)XJf(2y + r ll )Uf{2y + Vl + r^V? (2y + Vi + V2 + Vs), 
XM = 2Ul(y)Tr (r{ 9 (y)) , xM = 2Tr (T v q(y))U v (y). (3.3) 

Using various well-known identities, I have found that (j3.1)l becomes: 

S SF = 2a 3 {q{y)C^y)q{y + jl) + q{y + p)Cl{y)q{y) 

+ q(y) (A(y) + A\y)) q{y) + 8iamq(y)(l ® l)q(y)} . (3.4) 

I have defined the following link-dependent structures (in these two equations all sums 
are explicit): 

Greek and latin indices correspond to spinor and taste labels respectively. These have 
been suppressed in (13.4)1 . but are easy to put back in. For instance, q(y)A(y)q(y) = 
q aa (y)A al3 ' ab (y)q l3b (y). 

It is not difficult to show that (on the original lattice) A(y) is a parallel transporter 
from 2y back to 2y along a sum of paths. These paths traverse the (2a) 4 hypercube 
that is associated with the site y of the doubled lattice. A(y) thus transforms as a 
site variable at y on the doubled lattice. C^y) transports from 2y to 2(y + ft) along 
a sum of paths in the hypercube. It is thus a link variable from y to y + p, on the 
doubled lattice. This is consistent with the transformation properties of the quark 
fields appearing in ()3.4|) . 

Consider the action obtained by expanding in ag the links U^r) = exp[iaoA /U (r)] 
that appear implicitly through A(y) and C^(y) in ()3.4j) . Kluberg-Stern et al. give 
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this continuum approximation to the interacting theory up to 0(a 2 ) corrections [15]. 
This exposes the leading irrelevant operators, which are suppressed by a single power 
of a. It is a simple matter to decompose their expression into 4 degenerate tastes of 
WFs (lattice spacing 2a) and TV terms: 

SsF = SwF(A) + CiStV 

= (2a) 4 g(y) [M wm + aM TV ] q(y), 
M WF = 7^L> M - iaD^D^ + im, M W f(4) = M WF <g> 1 4 , 



TV 



T 



(7m - 7.) ® U + ^71[7m, 7,] ® + * v ) t 4)- ( 3 - 6 ) 



Here, is the gauge-covariant derivative and F^ v is the field-strength. 

In the same spirit, I can make Wilson fermions manifest in (|3.4j) . working entirely 
in terms of link variables. I define links that connect sites of the doubled lattice: 

UM = 11^)11^ + ji). (3.7) 

This permits one to define the action of 4 flavors of WF on the doubled lattice: 

S W f{a) = 4a 3 {q(y) {{i + 7(U ) <g> 1) U^(y)q(y + jX) 

+ q(y + A) ((* - 7„) ® l)Ul(y)q(y) + i{ima - 8)g(y)(l ® l)q(y)} ■ (3.8) 

If I add and subtract this from 1)3.4)1 . I then have Ssf = SVf(4) + aSrv, where 

S T y = (2a) 4 {q{y)C^y)q{y + p) + q{y + p)Cl(y)q(y) + q{y)A{y)q{y) } , 

1 9i 
^(y) =E —[A(y) + A\y)}+-(1®1), 

= ^C,(y)-±((i + ^)®l)U^y). (3.9) 



4 Discussion 

I have given an argument in support of the conclusion that the FRT poses no problem 
in perturbation theory about the = vacuum. The consistency of the FRT needs 
to be studied further at a nonperturbative level. 

I have suggested a semiclassical analysis about nontrivial link configurations; for 
example, ones that have nonzero topological charge. Perhaps that might allow for an 
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examination of nonperturbative TV vis-a-vis the fourth-root trick. The point is to 
compare to results obtained from other approaches in a similar regime. It would also 
be interesting to examine the strong coupling limit. Studies in these directions are 
currently in progress. 

It is not clear to me whether or not the questions of global topology suggested 
in [5] can be addressed by semiclassical or strong coupling methods. However, many 
other important questions of consistency may be accessible through the techniques 
envisaged here. 

Finally, I hope to present some useful applications of the Wilson fermion decom- 
position at a later date. 
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